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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 (i) APPLICANT: 

(B) NAME: CREATIVE BIOMOLCULES, INC 

(C) STREET: 45 SOUTH STREET 

(D) STATE: HOPKINTON 
10 (E) COUNTRY: USA 

X ( F ) POSTAL CODE (ZIP) : 01748 

(G) TELEPHONE: (508) -435-9001 

(H) TELEFAX: (508) -435-0992 

(I) TELEX: 

• (1) ^rS^UDWIG INSTITUTE FOR CANCER RESEARCH 

(C) STREET: 1345 AVENUE OF THE AMERICAS 

(D) STATE: NY 

20 (E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 10105 

(G) TELEPHONE: (212) -765-3000 

(H) TELEFAX: 

(I) TELEX: 



25 



30 



(ii) TITLE OF INVENTION: MORPHOGENY PROTEIN-SPECIFIC CELL 
SURFACE RECEPTORS AND USES THEREFOR 



(iii) NUMBER OF SEQUENCES: 15 



(iv) CORRESPONDENCE ADDRESS: mTOMT T7 fc 

(A) ADDRESSEE: PATENT ADMINISTRATOR, TESTA, HURWITZ & 

THIBEAULT 

(B) STREET: 53 STATE STREET 
35 (C) CITY: BOSTON 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02109 

40 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS . ^ 9 - 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

45 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

50 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: PITCHER, EDMUND R. 

(B) REGISTRATION NUMBER: 27,829 

(C) REFERENCE/DOCKET NUMBER: CRP-097PC 

55 (ix) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: (617) 248-7000 

(B) TELEFAX: (617) 248-7100 



60 



• ... • 



10 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1509 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..1509 ; m 
15 (D) OTHER INFORMATION: /product= "Human ALKl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

20 ATG ACC TTG GGC TCC CCC AGG AAA GGC CTT CTG ATG CTG CTG ATG GCC 
Met Thr Leu Gly Ser Pro Arg Lys Gly Leu Leu Met Leu Leu Met Aia 
1 5 10 ; 15 

TTG GTG ACC CAG GGA GAC CCT GTG AAG CCG TCT CGG GGC CCG CTG GTG 
25 Leu Val Thr Gin Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu Val 

20 25 30 



CAC AAC GTG TCC CTG GTG CTG GAG GCC ACC CAA CCT CCT TCG GAG CAG 
45 His Asn Val Ser Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin 
100 105 11° 

CCG GGA ACA GAT GGC CAG CTG GCC CTG ATC CTG GGC CCC GTG CTG GCC 
Pro Gly Thr Asp Gly Gin Leu Ala Leu He Leu Gly Pro Val Leu Ala 
115 120 125 



48 



96 



ACC TGC ACG TGT GAG AGC CCA CAT TGC AAG GGG CCT ACC TGC CGG GGG 144 
■V Thr Cys Thr Cys Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly 

30 35 40 45 

S GCC TGG TGC ACA GTA GTG CTG GTG CGG GAG GAG GGG AGG CAC CCC CAG 192 

tn Ala Trp Cys Thr Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin 

N 35 GAA CAT CGG GGC TGC GGG AAC TTG CAC AGG GAG CTC TGC AGG GGG CGC 240 
Glu His Arg Gly Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg 
65 70 75 ; 80 

40 CCC ACC GAG TTC GTC AAC CAC TAC TGC TGC GAC AGC CAC CTC TGC AAC 288 
Pro Thr Glu Phe Val Asn His Tyr Cys Cys Asp Ser His Leu Cys Asn 
85 90 95 



336 



384 



TTG CTG GCC CTG GTG GCC CTG GGT GTC CTG GGC CTG TGG CAT GTC CGA 432 
Leu Leu Ala Leu Val Ala Leu Gly Val Leu Gly Leu Trp His Val Arg 
130 135 140 

CGG AGG CAG GAG AAG CAG CGT GGC CTG CAC AGC GAG CTG GGA GAG TCC 480 
Arg Arg Gin Glu Lys Gin Arg Gly Leu His Ser Glu Leu Gly Glu Ser 
145 150 155 160 

AGT CTC ATC CTG AAA GCA TCT GAG CAG GGC GAC ACG ATG TTG GGG GAC 528 
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10 



U! 



40 



Ser Leu lie Leu Lys Ala Ser Glu Gin Gly Asp Thr Met Leu Gly Asp 
165 170 

CTC CTG GAC AGT GAC TGC ACC ACA GGG AGT GGC TCA GGG CTC CCC TTC 
SS S Sp Ser Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe 
180 I 85 

CTG GTG CAG AGG ACA GTG GCA CGG CAG GTT GCC TTG GTG GAG TGT GTG 
Leu Val Gin Arg Thr Val Ala Arg Gin Val Ala Leu Val Glu cys vax 
195 200 



GGA AAA GGC CGC TAT GGC GAA GTG TGG CGG GGC TTG TGG CAC GGT GAG 
Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Leu Trp His Gly Glu 



CTA GGC TTC ATC GCC TCA GAC ATG ACC TCC CGC AAC TCG AGC ACG CAG 
i= 25 Leu Gly Phe He Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin 
T, 260 265 



CTG TGG CTC ATC ACG CAC TAC CAC GAG CAC GGC TCC CTC TAC GAC TTT 
Li Leu Trp Leu He Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe 

« 30 275 280 285 



60 



TTT GGC CTG GTG CTG .TGG GAG ATT GCC CGC CGG ACC ATC GTG AAT GGC 



576 



624 



672 



210 215 

" AGT GTG GCC GTC AAG ATC TTC TCC TCG AGG GAT GAA CAG TCC TGG TTC 720 

Ser Val Ala Val Lys He Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe 
225 230 235 

5 20 CGG GAG ACT GAG ATC TAT AAC ACA GTA TTG CTC AGA CAC GAC AAC ATC 

Arg Glu Thr Glu He Tyr Asn Thr Val Leu Leu Arg, His Asp Asn lie 

— a 245 250 



768 



816 



864 



912 



CTG CAG AGA CAG ACG CTG GAG CCC CAT CTG GCT CTG AGG CTA GCT GTG 
Leu Gin Arg Gin Thr Leu Glu Pro His Leu Ala Leu Arg Leu Ala Val 
290 295 300 

" TCC GCG GCA TGC GGC CTG GCG CAC CTG CAC GTG GAG ATC TTC GGT ACA 960 
Ser Ala Ala Cys Gly Leu Ala His Leu His Val Glu He Phe Gly Thr 
305 310 3 " 



CAG GGC AAA CCA GCC ATT GCC CAC CGC GAC TTC AAG AGC CGC AAT GTG 1008 
Gin Gly Lys Pro Ala He Ala His Arg Asp Phe Lys Ser Arg Asn Val 
325 330 

CTG GTC AAG AGC AAC CTG CAG TGT TGC ATC GCC GAC CTG GGC CTG GCT 
45 Leu Val Lys Ser Asn Leu Gin Cys Cys He Ala Asp Leu Gly Leu Aia 
340 345 J3U 

GTG ATG CAC TCA CAG GGC AGC GAT TAC CTG GAC ATC GGC AAC AAC CCG 
Val Met His Ser Gin Gly Ser Asp Tyr Leu Asp He Gly Asn Asn Pro 
50 .355 360 365 

AGA GTG GGC ACC AAG CGG TAC ATG GCA CCC GAG GTG CTG GAC GAG CAG 
Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Gin 
370 375 380 

" ATC CGC ACG GAC TGC TTT GAG TCC TAC AAG TGG ACT GAC ATC TGG GCC 1200 
He Arg Thr Asp Cys Phe Glu Ser Tyr Lys Trp Thr Asp He Trp Aia 
385 390 395 



1056 



1104 



1152 



1248 




61 



10 



yi 20 



O 30 



H 35 



45 



Phe Gly Leu Val Leu Trp Glu lie Ala Arg Arg Thr lie Val Asn Gly 
405 410 



ATC GTG GAG GAC TAT AGA CCA CCC TTC TAT GAT GTG GTG CCC AAT GAC 
lie Val Glu Asp Tyr Arg Pro Pro Phe Tyr Asp Val Val Pro Asn Asp 
420 425 

CCC AGC TTT GAG GAC ATG AAG AAG GTG GTG TGT GTG GAT CAG CAG ACC 
Pro Ser Phe Glu Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr 
435 440 



465 470 475 

ACC GCG CTG CGG ATC AAG AAG ACA CTA CAA AAA ATT AGC AAC AGT CCA 
Thr Ala Leu Arg He Lys Lys Thr Leu Gin Lys lie Ser Asn Ser Pro 
485 490 • 



l£| GAG AAG CCT AAA GTG ATT CAA 

25 Glu Lys Pro Lys Val He Gin 

lis 500 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

40 Met Thr Leu Gly Ser Pro Arg Lys Gly Leu Leu Met Leu Leu Met Ala 
15 10 

Leu Val Thr Gin Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu Val 
20 25 JU 

Thr Cys Thr Cys Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly 
35 40 45 

Ala Trp Cys Thr Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin 
50 50 55 60 

Glu His Arg Gly Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg 
65 70 75 

55 Pro Thr Glu Phe Val Asn His Tyr Cys Cys Asp Ser His Leu Cys Asn 

85 SO * 3 

His Asn Val Ser Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin 
100 105 11° 

60 



1296 



1344 



1392 



CCC ACC ATC CCT AAC CGG CTG GCT GCA GAC CCG GTC CTC TCA GGC CTA 
Pro Thr He Pro Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu 
450 4 55 .' 460 

" GCT CAG ATG ATG CGG GAG TGC TGG TAC CCA AAC CCC TCT GCC CGA CTC 1440 
Ala Gin Met Met Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu 



1488 



1509 



• 



10 



15 



Pro Gly Thr Asp Gly Gin Leu Ala Leu He Leu Gly Pro Val Leu Ala 
115 120 125 

Leu Leu Ala Leu Val Ala Leu Gly Val Leu Gly Leu Trp His Val Arg 
130 135 140 

Arg Arg Gin Glu Lys Gin Arg Gly Leu His Ser Glu Leu Gly Glu Ser 
145 150 155 1^0 

Ser Leu He Leu Lys Ala Ser Glu Gin Gly Asp Thr Met Leu Gly Asp 
165 170 175 

Leu Leu Asp Ser Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe 
180 185 .. 190 

Leu Val Gin Arg Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val 
195 200 205 

= Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Leu Trp His Gly Glu 

y 20 210 215 220 

a Ser Val Ala Val Lys He Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe 

1 225 230 235 240 

^ 25 Arg Glu Thr Glu He Tyr Asn Thr Val Leu Leu Arg His Asp Asn He 
: 245 250 255 

* Leu Gly Phe He Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin 

260 265 270 

Leu Trp Leu lie Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe 
275 280 285 

Leu Gin Arg Gin Thr Leu Glu Pro His Leu Ala Leu Arg Leu Ala Val 
35 290 295 300 

Ser Ala Ala Cys Gly Leu Ala His Leu His Val Glu He Phe Gly Thr 
305 310 315 320 

Gin Gly Lys Pro Ala He Ala His Arg Asp Phe Lys Ser Arg Asn Val 
325 330 335 

Leu Val Lys Ser Asn Leu Gin Cys Cys He Ala Asp Leu Gly Leu Ala 
340 345 350 

Val Met His Ser Gin Gly Ser Asp Tyr Leu Asp He Gly Asn Asn Pro 
355 360 365 



30 



40 



45 



Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Gin 
50 370 375 380 

He Arg Thr Asp Cys Phe Glu Ser Tyr Lys Trp Thr Asp He Trp Ala 
385 390 395 400 

55 Phe Gly Leu Val Leu Trp Glu He Ala Arg Arg Thr He Val Asn Gly 

405 410 415 

He Val Glu Asp Tyr Arg Pro Pro Phe Tyr Asp Val Val Pro Asn Asp 
420 425 430 
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10 
15 

i s r 

S 20 

s 

« ini 

* 25 



H 3 



Pro Ser Phe Glu Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr 

Pro Thr lie Pro Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu 
.5 450 455 

Ala Gin Met Met Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu 
465 470 47i 

Thr Ala Leu Arg lie Lys Lys Thr Leu Gin Lys lie Ser Asn Ser Pro 
485 490 

Glu Lys Pro Lys Val lie Gin 
500 



35 



40 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 
30 (B) LOCATION: 104.. 1633 *tk2' 

30 (D ) OTHER INFORMATION: /product^ "Human ALK2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CTCCGAGTAC CCCAGTCACC AGAGTGAGAG AAGCTCTGAA CGAGGGCACG CGGCTTGAAG 
GACTGTGGGC AGATGTGACC AAGAGCCTGC ATTAAGTTGT ACA ATG GTA GAT GGA 



TGT GAAVGGT C TC TCC TCC GGT AAT GAG GAC ^ ^ *g 

Cys Glu Gly Leu Ser Cys Gly Asn Glu Asp His cys i*j.u ^xy 
40 « 

TGC TTT TCC TCA CTG AGC ATC AAC GAT GGC TTC CAC GTC TAC CAG AAA 
Cys Phe Ser Ser Leu Ser He Asn Asp Gly Phe His val Tyr ^xn y 
55 ■ 60 

GGC TGC TTC CAG GTT TAT GAG CAG GGA AAG ATG ACC TGT AAG ACC CCG 
Gly Cys Phe Gin Val Tyr Glu Gin Gly Lys Met Tnr u^ys kjt* 



60 
115 

163 



GTC ATG ATT CTT CCT GTG CTT ATC ATG ATT GCT CTC CCC TCC CCT AGT 
Val Met He Leu Pro Val Leu He Met He Ala Leu fro »e 
5 10 15 

45 ATG GAA GAT GAG AAG CCC AAG GTC AAC CCC AAA CTC TAC ATG TGT GTG 211 
Met Glu Asp Glu Lys Pro Lys Val Asn Pro Lys Leu Tyr Met Cys val 
25 30 



259 



307 



355 



60 — 70 " 75 80 
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85 9° 9b 

SSSSSS2SSS£SS5SE15SS 

105 11U 

ACA CAG AAT TTC CAC TTG GAG GTT GGC CTC ATT ATT CTC TCT GTA GTG 
T*hr Gin Asn Phe His Leu Olu Val Gly Leu lie He Leu Ser Val Val 
120 125 

TTC GCA GTA TGT CTT TTA GCC TGC CTG CTG GGA. GTT GOT CTC CGA AAA 
15 Phe Ala val Cys Leu Leu Ala Cys Leu Leu Gly Val Ala Leu Arg Lys 

135 140 

TTT AAA AGG CGC AAC CAA GAA CGC CTC AAT CCC CGA GAC GTG GAG TAT 
Phe Lys Arg Arg Asn Gin Glu Arg Leu Asn Pro Arg Asp Val Glu Tyr 



10 



20 150 



■as-; 



5 »■ SSSSSiJSSSSSSSSSSSSZSSS 

200 205 



50 



ACC CAG CTG TGG TTA ATT ACA CAT TAT CAT GAA ATG GGA TCG TTG TAG 
Thr Gin Leu Trp Leu lie Thr His Tyr His Glu Met Gly Ser Leu Tyr 
280 285 



GAC TAT CTT CAG CTT ACT ACT CTG GAT ACA GTT AGC TGC CTT CGA ATA 
55 Asp Tyr Leu Gin Leu Thr Thr Leu Asp Thr Val Ser Cys Leu Arg lie 
295 ' 300 305 

GTG CTG TCC ATA GCT ACT GGT CTT GCA CAT TTG CAC ATA GAG ATA TTT 
Val Leu Ser He Ala Ser Gly Leu Ala His Leu His He Glu lie Phe 
60 310 315 320 



403 



451 



499 



547 



595 



643 



GGC ACT ATC GAA GGG CTC ATC ACC ACC AAT GTT , GGA GAC AGC ACT TTA 
Gly Thr He Glu Gly Leu He Thr Thr Asn Val Gly Asp Ser Tnr jeu 
165 170 1 

" GCA GAT TTA TTG GAT CAT TCG TGT ACA TCA GGA AGT GGC TCT GGT CTT 691 
Ala Asp Leu Leu Asp His Ser Cys Thr Ser Gly Ser Gly Ser Gly Leu 



739 



787 



TGT GTC GGG AAA GGC AGG TAT GGT GAG GTG TGG AGG GGC AGC TGG CAA 
35 Cys Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp Gin 
215 220 li:> 

GGG GAA AAT GTT GCC GTG AAG ATC TTC TCC TCC CGT GAT GAG AAG TCA 
Gly Glu Asn Val Ala Val Lys He Phe Ser Ser Arg Asp Glu Lys Ser 
40 230 235 240 

TGG TTC AGG GAA ACG GAA TTG TAC AAC ACT GTG ATG CTG AGG CAT GAA 
Trp Phe Arg Glu Thr Glu Leu Tyr Asn Thr Val Met Leu Arg his wu 
245 250 255 

" AAT ATC TTA GGT TTC ATT GCT TCA GAC ATG ACA TCA AGA CAC TCC AGT 931 
Asn He Leu Gly Phe He Ala Ser Asp Met Thr Ser Arg His Ser Ser 
265 270 



835 



883 



979 



1027 



1075 
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a 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



OCO AC C CAA GGG AAA CCA GCC ATT GCC CAT CGA GAT TTA AAG AGC AAA 
Gly Thr Gin Gly Lys Pro Ala He Ala His Arg 34Q 
325 330 

AAT ATT CTG GTT AAG AAG AAT CGA CAG TGT TGC ATA CCA GAT TTG GGC 
Asn He Leu Val Lys Lys Asn Gly Gin cys 3&5 

345 ■ J3U 
CTG GCA GTC ATG CAT TCC CAG AGO ACC AAT CAG CTT GAT GTG GGG AAC 
Leu Ala Val Met His Ser Gin Ser Thr Asn ein k ^ 

360 365 
AAT CCC CGT GTG GGC ACC AAG CGC TAC ATG GCC." CCC GAA GTT CTA GAT 
Asn Pro Arg Val Gly Thr Lys Arg Tyr Met Ala Pro ^ 

375 380 
GAA ACC ATC CAG GTG ; GAT TGT TTC GAT TCT TAT AAA AGG GTC GAT ATT ; 
Glu Thr He Gin Val Asp Cys Phe Asp ser iyr w 

390 395 
TGG GCC TTT GGA CTT GTT TTG TGG GAA GTG GCC. AGG CGG ATG GTG AGC 
Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg Arg n ^ 

AAT GGT ATA GTG GAG GAT TAC AAG CCA CCG TTC TAC GAT GTG GTT CCC 
Asn Gly He Val Glu Asp Tyr Lys Pro Pro Pne iyr v ^ 5 

425 i3V 
AAT GAC CCA AGT TTT GAA GAT ATG AGG AAG GTA GTC TGT GTG GAT CAA 
Asn Asp Pro Ser Phe Glu Asp Met Arg Lys Val Val Cys vai 
440 445 

CAA AGG CCA AAC ATA CCC AAC AGA phe Ser Asp Pxo Thx Leu Thx 
Gin Arg Pro Asn He Pro Asn Arg Trp Phe ser asp 
455 460 

TCT CTG GCC AAG CTA ATG AAA GAA TGC TGG TAT CAA AAT CCA TCC GCA 
Ser Leu Ala Lys Leu Met Lys Glu Cys Trp Tyr Gxn ash 

470 475 
AGA CTC ACA GCA CTG CGT ATC AAA AAG ACT TTG ACC AAA ATT GAT AAT 
Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Thr Lys ixe P ^ 

TCC CTC GAC AAA TTG AAA ACT GAC TGT TGACATTTTC ATAGTGTCAA 
Ser Leu Asp Lys Leu Lys Thr Asp Cys 

505 510 

GAAGGAAGAT TTGACGTTGT TGTCATTGTC CAGCTGGGAC CTAATGCTGG CCTGACTGGT 
TGTCAGAATG GAATCCATCT GTCTCCCTCC CCAAATGGCT GCTTTGACAA GGCAGACGTC 
GTACCCAGCC ATGTGTTGGG GAGACATCAA AACCACCCTA ACCTCGCTCG ATGACTGTGA 
ACTGGGCATT TCACGAACTG TTCACACTGC AGAGACTAAT GTTGGACAGA CACTGTTGCA 
AAGGTAGGGA CTGGAGGAAC ACAGAGAAAT CCTAAAAGAG ATCTGGGCAT TAAGTCAGTG 
GCTTTGCATA GCTTTCACAA GTCTCCTAGA CACTCCCCAC GGGAAACTCA AGGAGGTGGT 



1123 

1171 

1219 

1267 

1315 

13.63 

1411 

1459 

1507 

1555 

1603 

1650 

1710 
1770 
1830 
1890 
1950 
2010 



- 66 - 



GAATTTTTAA TCAGCAATAT TGCCTGTGCT TCTCTTCTTT ATTGCACTAG GAATTCTTTG 
CATTCCTTAC TTGCACTGTT ACTCTTAATT TTAAAGACCC AACTTGCCAA AATGTTGGCT 
5 GCGTACTCCA CTGGTCTGTC TTTGGATAAT AGGAATTCAA TTTGGCAAAA CAAAATGTAA 
TGTCAGACTT TGCTGCATTT TACACATGTG CTGATGTTTA CAATGATGCC GAACATTAGG 
10 AATTGTTTAT ACACAACTTT GCAAATTATT TATTACTTGT GCACTTAGTA GTTTTTACAA 
AACTGCTTTG TGCATATGTT AAAGCTTATT TTTATGTGGT CTTATGATTT TATTACAGAA 
ATGTTTTTAA CACTATACTC TAAAATGGAC ATTTTCTTTT ,'ATTATCAGTT AAAATCACAT 
" TTTAAGTGCT TCACATTTGT ATGTGTGTAG ACTGTAACTT TTTTTCAGTT CATATGCAGA 
„ ACGTATTTAG CCATTACCCA CGTGACACCA : CCGAATATAT TATCGATTTA GAAGCAAAGA 

3 20 TTTCAGTAGA ATTTTAGTCC TGAACGCTAC GGGGAAAATG CATTTTCTTC AGAATTATCC 
| ATTACGTGCA TTTAAACTCT GCCAGAAAAA AATAACTATT TTGTTTTAAT CTACTTTTTG 

?1 TATTTAGTAG TTATTTGTAT AAATTAAATA AACTGTTTTC AAGTCAAAAA AAAA 

Zl 25 

k| (2) INFORMATION FOR SEQ ID NO: 4: 

' (i) SEQUENCE CHARACTERISTICS: 

N" 30 (A) LENGTH: 509 amino acids 

O (B) TYPE: amino acid 

lI (D) TOPOLOGY: linear 

S (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Val Asp Gly Val Met lie Leu Pro Val Leu lie Met lie Ala Leu 
1 5 10 

Pro Ser Pro Ser Met Glu Asp Qlu Lys Pro Lys Val Asn Pro Lys Leu 
20 25 

Tyr Met Cys Val Cys Glu Gly Leu Ser Cys Gly Asn Glu Asp His Cys 
35 40 * 3 

Glu Gly Gin Gin Cys Phe Ser Ser Leu Ser He Asn Asp Gly Phe His 
50 55 60 

Val Tyr- Gin Lys Gly Cys Phe Gin Val Tyr Glu Gin Gly Lys Met Thr 
65 70 75 

Cys Lys Thr Pro Pro Ser Pro Gly Gin Ala Val' Glu Cys Cys Gin Gly 
85 90 

Asp Trp Cys Asn Arg Asn lie Thr Ala Gin Leu Pro Thr Lys Gly Lys 
100 105 

Ser Phe Pro Gly Thr Gin Asn Phe His Leu Glu Val Gly Leu He He 
60 115 120 125 



35 



40 



45 



50 



55 



2610 
2670 
2724 
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Leu Ser 
130 



Val Val Phe Ala Val Cys Leu Leu Ala Cys Leu Leu Gly Val 



135 



140 



Ala Leu Arg Lys Phe Lys Arg Arg Asn Gin Glu Arg Leu Asn Pro Arg 



145 150 155 



Asp 



Val Glu Tyr Gly Thr He Glu Gly Leu He Thr Thr Asn Val Gly 



165 



170 



Asp Ser 



Thr Leu Ala Asp Leu Leu Asp His Ser Cys Thr Ser Gly Ser 



180 



185 



Gly Ser Gly Leu Pro Phe Leu Val Gin Arg Thr .-Val Ala Arg Gin He 
195 ' 200 ' 205 

Thr Leu Leu Glu Cys Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg 
210 215 220 

Gly Ser Trp Gin Gly Glu Asn Val Ala Val Lys He Phe Ser Ser Arg 



225 



230 



235 



Asp Glu Lys Ser Trp Phe Arg Glu Thr Glu Leu Tyr Asn Thr Val Met 



245 



250 



Leu Arg His Glu Asn He Leu Gly Phe He Ala Ser Asp Met Thr Ser 



260 265 

Arg His Ser Ser Thr Gin Leu Trp Leu He Thr His Tyr His Glu Met 

- "* 285 



275 



280 



Gly Ser Leu Tyr Asp Tyr Leu Gin Leu Thr Thr Leu Asp Thr Val Ser 
290 295 30° 

Cys Leu Arg He Val Leu Ser He Ala Ser Gly Leu Ala His Leu His 



305 



310 



315 



He Glu He Phe Gly Thr Gin Gly Lys Pro Ala He Ala His Arg Asp 
325 330 . 335 

Leu Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Gin Cys Cys He 



340 



345 



Ala Asp Leu Gly Leu Ala Val Met His Ser Gin Ser Thr Asn Gin Leu 
355 360 365 

Asp Val Gly Asn Asn Pro Arg Val Gly Thr Lys Arg Tyr Met Ala Pro 
370 375 380 

Glu Val '.Leu Asp Glu Thr He Gin Val Asp Cys Phe Asp Ser Tyr Lys 



385 



390 



395 



Arg Val Asp He Trp Ala Phe Gly Leu Val Leu Trp Glu Val Ala Arg 



405 



410 



Arg Met Val Ser Asn Gly He Val Glu Asp Tyr Lys Pro Pro Phe Tyr 
420 425 43 ° 

Asp Val val Pro Asn Asp Pro Ser Phe Glu Asp Met Arg Lys Val Val 
435 440 445 



• ... • 



a 3 



5 3 3 



a-j. 



Cys Val Asp Gin Gin Arg Pro Asn lie Pro Asn Arg Trp Phe Ser Asp 
450 455 
, Pro Thr Leu Thr Ser Leu Ala Lys Leu Met Lys Glu Cys Trp Tyr Gin 
465 470 

Asn Pro Ser Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Thr 



10 



^ 25 



40 



485 *9° 495 



Lys He Asp Asn Ser Leu Asp Lys Leu Lys Thr Asp Cys 
500 505 



15 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
q (A) LENGTH: 2932 base pairs 

~f, (B) TYPE: nucleic acid 

*f 2 o (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 310.. 1908 » T y». 
(D) OTHER INFORMATION: /product= "Human ALKJ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GCTCCGCGCC GAGGGCTGGA GGATGCGTTC CCTGGGGTCC GGACTTATGA AAATATGCAT 
CAGTTTAATA CTGTCTTGGA ATTCATGAGA TGGAAGCATA GGTCAAAGCT GTTTGGAGAA 
AATCAGAAGT ACAGTTTTAT CTAGCCACAT CTTGGAGGAG TCGTAAGAAA GCAGTGGGAG 
TTGAAGTCAT TGTCAAGTGC TTGCGATCTT TTACAAGAAA ATCTCACTGA ATGATAGTCA 
TTTAAATTGG TGAAGTAGCA AGACCAATTA TTAAAGGTGA CAGTACACAG GAAACATTAC 



AATTGAACA ATG ACT CAG CTA TAC ATT TAC ATC AGA TTA TTG GGA GCC 
45 Met Thr Gin Leu Tyr He Tyr He Arg Leu Leu Gly Ala 

1 5 10 

TAT TTG TTC ATC ATT TCP CGT GTT CAA GGA CAG AAT CTG GAT ACT ATG 
Tyr Leu Phe He He Ser Arg Val Gin Gly Gin Asn Leu Asp Ser Met 
50 15. 20 25 

CTT CAT GGC ACT GGG ATG AAA TCA GAC TCC GAC CAG AAA AAG TCA GAA 

Leu His Gly Thr Gly Met Lys Ser Asp Ser Asp Gin Lys Lys Ser Glu 
30 35 40 

" AAT GGA GTA ACC TTA GCA CCA GAG GAT ACC TTG CCT TTT TTA AAG TGC 492 
Asn Gly Val Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys cys 
50 55 °V 



60 



TAT TGC TCA GGG CAC TGT CCA GAT GAT GCT ATT AAT AAC. ACA TGC ATA 



240 
300 
348 

396 

444 



540 



- 69 - 



20 



25 



U 30 



Tyr Cys Ser Gly His Cys Pro Asp Asp Al. He Asn Asn Thr Cys He 

ACT AAT GGA CAT TGC TTT GCC ATC ATA GAA GAA GAT GAC CAG GGA GAA 
5 Thr Asn Gly His Cys Phe Ala lie lie giu oiu v k 
80 85 

- t r* s ss^ssi^sis s Asp s a; 

Thr Thr Leu Ala Ser Gly cys weu lj= j 
10 95 100 

TGC AAA GAT TCT CCA AAA GCC CAG CTA CGC CGG ACA ATA GAA TGT TGT 
Cys Lys Asp Ser Pro Lys Ala Gin Leu Arg Arg Thr lie fai y ^ 
110 115 
" CGG ACC AAT TTA TGT AAC CAG TAT TTG CAA GCC ACA CTG CCC CCT GTT 
Arg Thr Asn Leu Cys Asn Gin Tyr Leu Gin Pro Tnr urn 

130 135 
GTC ATA GGT CCG TTT TTT GAT GGC AGC ATT CGA TGG CTG GTT TTG CTC 
Val He Gly Pro Phe Phe Asp Gly Ser lie Arg irp ^ 

145 150 
ATT TCT ATG GCT GTC T^C ATA ATT GCT ATG ATC ATC TTC TCC AGC TGC 
He Ser Met Ala Val Cys He He Ala Met He lie rne 

160 165 
TTT TGT TAC AAA CAT TAT TGC AAG AGC ATC TCA AGC AGA CGT CGT TAC 
Phe Cys Tyr Lys His Tyr Cys Lys Ser He Ser Ser Arg Arg Arg 

175 180 1B: > 

AAT CGT GAT TTG GAA CAG GAT GAA GCA TTT ATT CCA GTT GGA GAA TCA 
Asn Arg Asp Leu Glu Gin Asp Glu Ala Phe He Pro vai y ^ 
U 190 195 

M " CTA AAA GAC CTT ATT GAC CAG TCA CAA AGT TCT GGT AGT GGG TCT GGA 
Leu Lys Asp Leu He Asp Gin Ser Gin Ser Ser Gly &er v, y ^ 

210 215 
CTA CCT TTA TTG GTT CAG CGA ACT ATT GCC AAA CAG ATT CAG ATG GTC 
Leu Pro Leu Leu Val Gin Arg Thr lie Ala Lys Gin lie 

225 230 
CGG CAA GTT GGT AAA GGC CGA TAT GGA GAA GTA TGG ATG GGC AAA TGG 
Arg Gin Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Met tiy y 
240 245 

CGT GGC GAA AAA GTG GCG GTG AAA GTA TTC TTT ACC ACT GAA GAA GCC 
Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu giu a 



40 



45 



50 



255; " 260 265 



588 



636 



684 



732 



780 



828 



876 



924 



972 



1020 



1068 



1116 



1164 



AGC TGG TTT CGA GAA ACA GAA ATC TAC CAA ACT CTG CTA ATG CGC CAT 
Ser Trp Phe Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg 
270 275 280 

55 GAA AAC ATA CTT GGT TTC ATA GCG GCA GAC ATT AAA GGT ACA GGT TCC 1212 
Glu Asn He Leu Gly Phe He Ala Ala Asp He Lys Gly Tnr ^y 
290 295 



60 



TGG ACT CAG CTC TAT. TTG ATT ACT GAT TAC CAT GAA AAT GGA TCT CTC 



1260 



• ... • 



10 



15 



20 



25 



U 30 



35 



40 



45 



50 



55 



Trp Thr Gin Leu Tyr Leu lie Thr Asp Tyr His Glu Asn Gly Ser Leu 
305 310 

J-SSSSSS5SS222SSS 

335 340 3 " 

TAT GGC ACC CAA GGA AAG CCC OCA ATT OCT CAT CGA GAC CTA AAG AGC 

Tyr Gly Thr Gin Gly Lys Pro Ala He Ala His Arg asp * ^ 

350 355 360 -- 

AAA AAC ATC CTC AAG AAA AAT GGG AGT TGC TGC ATT GCT GAC CTG 

Lys Asn He Leu He Lys Lys Asn Gly Ser cys Lys a. 
370 375 

GGC CTT GCT GTT AAA TTC AAC AGT GAC ACA AAT GAA GTT GAT GTG CCC 
Gly Leu Ala Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp 

TTG AAT ACC AGG GTG GGC ACC AAA CGC TAC ATG GCT CCC GAA GTG CTG 
Leu Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Ala Pro giu 

400 405 
GAC GAA AOC CTG AAC AAA AAC CAC TTC CAG CCC TAC ATC ATG GCT GAC 
Asp Glu Ser Leu Asn Lys Asn His Phe Gin Pro iyr 
415 420 

ATC TAC AGC TTC GGC CTA ATC ATT TGG GAG ATG GCT CGT CGT TGT ATC 
He Tyr Ser Phe Gly Leu He He Trp Glu Met Ala Arg Arg ^ ^ 
430 435 

ACA GGA GGG ATC GTG GAA GAA TAC CAA TTG CCA TAT TAC AAC ATG GTA 
Thr Gly Gly He Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met va 
450 455 

CCG AGT GAT CCG TCA TAC GAA GAT ATG CGT GAG GTT GTG TGT GTC AAA 
Pro Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys val i^ys 
465 4 ?0 4/3 

CGT TTG CGG CCA ATT GTG TCT AAT CGG TGG AAC AGT GAT GAA TGT CTA 
Arg Leu Arg Pro He Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu 

480 485 
CGA GCA GTT TTG AAG CTA ATG TCA GAA TGC TGG GCC CAC AAT CCA ; GCC 
Arg Ala Val Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn rro 
495', 500 505 

TCC AGA CTC ACA GCA .TTG AGA ATT AAG AAG ACG CTT GCC AAG ATG GTT 
Ser Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Ala Lys wee v 
510 515 520 

GAA TCC CAA GAT GTA AAA ATC TGATGGTTAA ACCATCGGAG GAGAAACTCT 
Glu Ser Gin Asp Val Lys He 
530 



60 AGACTGCAAG AACTGTTTTT ACCCATGGCA TGGGTGGAAT TAGAGTGGAA TAAGGATGTT 



1308 



1356 



1404 



1452 



1500 



1548 



1596 



1644 



1692 



1740 



1788 



1836 



1884 



1935 



1995 



? 3 



10 



20 



5 >o 



45 



AACTTGGTTC TCAGACTCTT TCTTCACTAC GTGTTCACAG GCTGCTAATA TTAAACCTTT 
CAGTACTCTT ATTAGGATAC AAGCTGGGAA CTTCTAAACA CTTCATTCTT TATATATGGA 
CAGCTTTATT TTAAATGTGG TTTTTGATGC CTTTTTTTAA GTGGGTTTTT ATGAACTGCA 
TCAAGACTTC AATCCTGATT AGTGTCTCCA GTCAAGCTCT GGGTACTGAA TTGCCTGTTC 
ATAAAACGGT GCTTTCTGTG AAAGCCTTAA GAAGATAAAT GAGCGCAGCA GAGATGGAGA 
AATAGACTTT GCCTTTTACC TGAGACATTC AGTTCGTTTG TATTCTACCT TTGTAAAACA 
GCCTATAGAT GATGATGTGT TTGGGATACT GCTTATTTTA .TG ATAGTTTG TCCTGTGTCC 
TTAGTGATGT GTGTGTGTCT CCATGCACAT GCACGCCGGG ATTCCTCTGC TGCCATTTGA 
ATTAGAAGAA AATAATTTAT ATGCATGCAC AGGAAGATAT TGGTGGCCGG TGGTTTTGTG 
CTTTAAAAAT GCAATATCTG ACCAAGATTC GCCAATCTCA TACAAGCCAT TTACTTTGCA 
AGTGAGATAG CTTCCCCACC AGCTTTATTT TTTAACATGA AAGCTGATGC CAAGGCCAAA 
AGAAGTTTAA AGCATCTGTA AATTTGGACT GTTTTCCTTC AACCACCATT TTTTTTGTGG 
TTATTATTTT TGTCACGGAA AGCATCCTCT CCAAAGTTGG AGCTTCTATT GCCATGAACC 
ATGCTTACAA AGAAAGCACT TCTTATTGAA GTGAATTCCT GCATTTGATA GCAATGTAAG 
TGCCTATAAC CATGTTCTAT ATTCTTTATT CTCAGTAACT TTTAAAAGGG AAGTTATTTA 
TATTTTGTGT ATAATGTGCT TTATTTGCAA ATCACCC 



35 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Thr Gin Leu Tyr He Tyr He Arg Leu Leu Gly Ala Tyr Leu Phe 
1 5 10 

He He Ser Arg Val Gin Gly Gin Asn Leu Asp Ser Met Leu His Gly 
50 20 25 30 

Thr Gly Met Lys Ser Asp Ser Asp Gin Lys Lys Ser Glu Asn Gly Val 
35 40 45 

55 Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys Cys Tyr Cys Ser 
50 55 60 

Gly His Cys Pro Asp Asp Ala He Asn Asn Thr Cys He Thr Asn Gly 
65 70 75 



2895 
2932 



m 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



His Cys Phe Ala lie He Glu Glu Asp Asp Gin Gly Glu Thr Thr Leu 



85 90 95 



Ala ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phe Gin Cys Lys Asp 
100 105 liU 

Ser Pro Lys Ala Gin Leu Arg Arg Thr He Glu Cys Cys Arg Thr Asn 
115 120 liSS 

Leu Cys Asn Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val Val He Gly 



130 



135 



Pro Phe Phe Asp Gly Ser lie Arg Trp Leu Val Leu Leu He Ser Met 
145 150 155; 

Ala Val Cys He lie Ala Met He He Phe Ser Ser Cys Phe Cys Tyr 



165 



170 



Lys His Tyr Cys Lys Ser lie Ser Ser Arg Arg Arg Tyr Asn Arg Asp 



180 



185 



Leu Glu Gin Asp Glu Ala Phe He Pro Val Gly.Glu Ser Leu Lys Asp 
195 200 205 

Leu He Asp Gin Ser Glri Ser Ser Gly Ser Gly Ser Gly Leu Pro Leu 
210 215 220 

Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val Arg Gin Val 
225 230 "5 ^ 

Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg Gly Glu 



245 



250 



Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe 



260 



265 



Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg His Glu Asn He 
275 280 2 ° 5 

Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly Ser Trp Thr Gin 
290 295 300 

Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe 



305 



310 



315 



Leu Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr 



325 



330 



Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu He Tyr Gly Thr 
340 345 350 

Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser Lys Asn He 
355 360 365 

Leu He Lys Lys Asn Gly Ser Cys Cys He Ala Asp Leu Gly Leu Ala 
370 375 380 

Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp Val Pro Leu Asn Thr 



385 



390 



395 



20 



Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Ser 
405 410 

Leu Asn Lys Asn His Phe Gin Pro Tyr lie Met Ala Asp lie Tyr Ser 
5 420 425 «-> u 

Phe Gly Leu He lie Trp Glu Met Ala Arg Arg Cys lie Thr Gly Gly 

435 440 443 

10 He Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val Pro Ser Asp 
450 455 460 

Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys Arg Leu Arg 
465 470 47 V 

Pro lie Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu Arg Ala Val 
485 490 

Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala Ser Arg Leu 
500 505 310 

Thr Ala Leu Arg lie Lys Lys Thr Leu Ala Lys. Met Val Glu Ser Gin 
515 520 525 



15 



25 Asp Val Lys He 
530 



30 



(2) INFORMATION FOR SEQ ID NO: 7: 



_ (i) SEQUENCE CHARACTERISTICS: 

M: (A) LENGTH: 1952 base pairs 

fft (B) TYPE: nucleic acid 

:£f (C) STRANDEDNESS : single 

W 35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

40 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 187.. 1695 . 
(D) OTHER INFORMATION: /product* "Murine ALKb 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AAGCGGCGGC AGAAGTTGCC GGCGTGGTGC TCGTAGTGAG GGCGCGGAGG ACCCGGGACC 

50 TGGGAAGCGG CGGCGGGTTA ACTTCGGCTG AATCACAACC ATTTGGCGCT GAGCTATGAC 

AAGAGAGCAA ACAAAAAGTT AAAGGAGCAA CCCGGCCATA AGTGAAGAGA GAAGTTTATT 180 

GATAAC ATG CTC TTA CGA AGC TCT GGA AAA TTA AAT GTG GGC ACC AAG 
55 Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys 

1 • 5 10 



AAG GAG GAT GGA GAG AGT ACA GCC CCC ACC CCT CGG CCC AAG ATC CTA 
Lys Glu Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys He Leu 
60 15 20 25 



60 
120 



228 



276 



• _ 74 . • 



10 



15 



Jl 20 



U 30 



n 35 



40 



CGT TGT AAA TGC CAC CAC CAC TGT CCG GAA GAC TCA GTC AAC AAT ATC 
Arg Cys Lys Cys His His His Cys Pro Glu Asp Ser Val Asn Asn lie 
35 40 " 



TCC AGC ACA GAT GGG TAC TGC TTC ACG ATG ATA GAA GAA GAT GAC TCT 
Ser 5£ Sp §S Sr Cys Phe Thr Met lie Glu Glu Asp Asp Ser 

50 55 

GGA ATG CCT GTT GTC ACC TCT GGA TGT CTA GGA CTA GAA GGG TCA GAT 
Gly Met Pro Val Val Thr Ser Gly Cys Leu Gly Leu Glu Gly Ser Asp 
65 70 75 

TTT CAA TGT CGT GAC ACT CCC ATT CCT CAT CAA AGA AGA TCA ATT GAA 
Phe Gin Cys Arg Asp Thr Pro lie Pro His Gin Arg Arg Ser lie Glu 
80 85 90 

TGC TGC ACA GAA AGG AAT GAG TGT AAT AAA GAC CTC CAC CCC ACT CTC 
Cys Cys Thr Glu Arg Asn Glu Cys Asn Lys Asp Leu His Pro Thr Leu 
95 100 105 



GCC TTla LTT Alt- JA-1 v»±\» Av_i uiv. 7 , . - Tl - 

Ala Leu Leu lie Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu lie 
135 14U 



130 



ATT TTA TTC TGT TAC TTC AGG TAT AAA AGA CAA GAA GCC CGA CCT CGG 
He Leu Phe Cys Tyr Phe Arg Tyr Lys Arg Gin Glu Ala Arg Pro Arg 
145 150 155 

TAC AGC ATT GGG CTG GAG CAG GAC GAG ACA TAC ATT CCT CCT GGA GAG 
Tyr Ser He Gly Leu Glu Gin Asp Glu Thr Tyr He Pro Pro Gly Glu 
160 165 I 70 



Val Lys Gin He Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys 
210 215 " u 



50 TGG CGT GGA GAA AAG GTG GCT GTG AAA GTG TTC TTC ACC ACG GAG GAA 
Trp Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu 
225 230 235 

GCC AGC TGG TTC CGA GAG ACT GAG ATA TAT CAG ACG GTC CTC ATC CGG 
55 Ala Ser Trp Phe Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg 
240 245 250 



CAT GAG AAT ATT CTG GGG TTC ATT GCT GCA GAT ATC AAA GGG ACT GGG 
His Glu Asn He Leu Gly Phe He Ala Ala Asp He Lys Gly Thr Gly 
60 255. 260 265 i,yi 



324 



372 



420 



468 



516 



564 



* CCT CCT CTC AAG GAC AGA GAT TTT GTT GAT GGG. CCC ATA CAC CAC AAG 

00 pro Pro Leu Lys Asp Arg Asp Phe Val Asp Gly Pro He His His Lys 

ru us 120 

? 25 GCC TTG CTT ATC TCT GTG ACT GTC TGT ACT TTA CTC TTG GTC CTC ATT 612 



660 



708 



756 



TCC CTG AGA GAC TTG ATC GAG CAG TCT CAG AGC TCG GGA AGT GGA TCA 
Ser Leu Arg Asp Leu He Glu Gin Ser Gin Ser Ser Gly Ser Gly Ser 
175 180 185 1SU 

GGC CTC CCT CTG CTG GTC CAA AGG ACA ATA GCT AAG CAA ATT CAG ATC 
Gly Leu Pro Leu Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met 
195 200 zua 

" GTG AAG CAG ATT GGA AAA GGC CGC TAT GGC GAG GTC TGG ATG GGA AAG 852 



804 



900 



948 



996 



TCC TGG ACT CAG TTG TAC CTC ATC ACA GAC TAT CAT GAA AAC GGC TCC 
2r £p Thr Gin Leu Tyr Leu lie Thr Asp Tyr Hi. Olu Asn Gly Ser 

275 20U 

5 CTT TAT GAC TAT CTG AAA TCC ACC ACC TTA GAC GCA AAG TCC ATG CTG 
Leu Tyr Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met: 
290 295 

305 310 

ATC TTT AGC ACT CAA GGC AAG CCA GCA ATC GCC CAT CGA GAC TTG AAA 
15 He Phe Ser Thr Gin Gly Lys Pro Ala He Ala His Arg Asp neu y 
320 325 330 

ACT AAA AAC ATC CTG GTG AAG AAA AAT GGA ACT TGC TGC ATA GCA GAC 
sir Lys Asn lie Leu Val Lys Lys Asn Gly Thr Cys Cys He Ala Asp 
335 340 34b 

CTG GGC TTG GCT GTC AAG TTC ATT ACT GAC ACA. AAT GAG GTT GAC ATC 
Leu Gly Leu Ala Val Lys Phe lie Ser Asp Thr Asn Glu Val Asp lie 
355 360 



\D 20 335 



H 30 



.35 



CTG GAC GAG AGC TTG AAT AGA AAC CAT TTC CAB ■«£ *£ «" — ~- 
Leu Asp Glu Ser Leu Asn Arg Asn His Phe Gin Ser Tyr He Met 
385 390 

GAC ATG TAC AGC TTT GGA CTC ATC CTC TGG GAG ATT Arc Arg £s 

Asp Met Tyr Ser Phe Gly Leu He Leu Trp Glu He Ala Arg Arg cys 

405 410 



400 



AAG AAG TTA CGG CCT TCA TTC CCC. AAT CGA TGG AGC ACT GAT GAG TGT 
Lys Lys Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Asp Glu Cys 

CTC AGG CAG ATG GGG AAG CTT ATG ACA GAG TGC TGG GCG CAG AAT CCT 
Leu Arg Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala Gin Asn Pro 
465 470 475 

GCC TCC AGG CTG ACG GCC CTG AGA GTT AAG AAA ACC CTT GCC AAA ATG 
Ala Ser Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met 

480 485 490 

TCA GAG TCC CAG GAC ATT AAA CTC TGACGTCAGA TACTTGTGGA CAGAGCAAGA 
Ser Glu Ser Gin Asp He Lys Leu 
60 495 500 



50 



55 



1044 



1092 



1140 



1188 



1236 



1284 



P " CCA CCC AAC ACC CGG GTT GGC ACC AAG CGC TAT ATG CCT CCA GAA GTG 1332 
^ Pro Pro Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro giu v 

li 370 375 ■ }0U 

CTG GAC GAG AGC TTG AAT AGA AAC CAT TTC CAG TCC TAC ATT ATG GCT 
Glu Ser Leu Asn Arg Asn 
385 390 



1380 



1428 



1476 



GTT TCT GGA GGT ATA GTG GAA GAA TAC CAG CTT CCC TAT CAC GAC CTG 
Val Ser Gly Gly He Val Glu Glu Tyr Gin Leu Pro Tyr His Asp Leu 
40 415 420 425 

GTG CCC ACT GAC CCT TCT TAT GAG GAC ATG AGA CAA ATT GTG TGC ATG 
Val Pro Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu He Val Cys Met 
435 440 

45 .no hot «&T GAG TGT 1572 



1524 



1620 



1668 



1722 



- 76 - 



10 



ATTTCACAGA AGCATCGTTA GCCCAAGCCT TGAACGTTAG CCTACTGCCC AGTGAGTTCA 1782 

GACTTTCCTG GAAGAGAGCA CGGTGGGCAG ACACAGAGGA ACCCAGAAAC ACGGATTCAT 1842 

CATGGCTTTC TGAGGAGGAG AAACTGTTTG GGTAACTTGT TCAAGATATG ATGCATGTTG 1902 
CTTTCTAAGA AAGCCCTGTA TTTTGAATTA CCATTTTTTT ATAAAAAAAA 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) t SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: protein 

y3 20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

S Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val . Gly Thr Lys Lys Glu 

^ 1 5 10 15 

W 25 Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys He Leu Arg Cys 
£ ■ 20 25 JU 

W Lys Cys His His His Cys Pro Glu Asp Ser Val Asn Asn He Cys Ser 

s 35 40 43 

G Thr Asp Gly Tyr Cys Phe Thr Met He Glu Glu Asp Asp Ser Gly Met 

M. 50 55 60 

? Pro Val Val Thr Ser Gly Cys Leu Gly Leu Glu Gly Ser Asp Phe Gin 

O 35 65 70 75 

Cys Arg Asp Thr Pro He Pro His Gin Arg Arg Ser He Glu Cys Cys 
85 90 " 

40 Thr Glu Arg Asn Glu Cys Asn Lys Asp Leu His Pro Thr Leu Pro Pro 
100 105 lxyj 

Leu Lys Asp Arg Asp Phe Val Asp Gly Pro He His His Lys Ala Leu 
115 120 125 

45 Leu He Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu He He Leu 
130 135 I 40 

Phe Cys Tyr Phe Arg Tyr Lys Arg Gin Glu Ala Arg Pro Arg Tyr Ser 
50 145 150 155 

He Gly Leu Glu Gin Asp Glu Thr Tyr He Pro Pro Gly Glu Ser Leu 
165 170 175 

55 Arg Asp Leu He Glu Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu 
180 185 iSU 

Pro Leu Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val Lys 
195 200 205 



1952 



77 - 



45 



Gin lie Gly Lys Gly Arg Tyr Gly Olu Val Trp Met Oly Lys Trp Arg 

210 215 ^ 

Gly Olu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser 



225 



230 



Trp Phe Arg Glu Thr Glu lie Tyr Gin Thr Val Leu Met Arg His Glu 

245 250 

10 Asn lie Leu Gly Phe lie Ala Ala Asp He Lys Gly Thr Gly Ser Trp 

260 z " 

Thr Gin Leu Tyr Leu lie Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr 
275 280 -• 

" Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met Leu Lys Leu 
290 295 300 

Ala Tyr Ser Ser Val Ser Gly Leu Cys His Leu His Thr Glu lie Phe 
20 305 310 31: > 

Ser Thr Gin Gly Lys Pro Ala lie Ala His Arg. Asp Leu Lys Ser Lys 
325 330 

25 Asn lie Leu Val Lys Lys Asn Gly Thr Cys Cys He Ala Asp Leu Gly 
340 345 
Leu Ala Val Lys Phe lie Ser Asp Thr Asn Glu Val Asp lie Pro Pro 
355 3 60 

30 Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro Glu Val Leu Asp 
370 375 380 

Glu Ser Leu Asn Arg Asn His Phe Gin Ser Tyr He Met Ala Asp Met 
35 385 390 395 

Tyr Ser Phe Gly Leu He Leu Trp Glu lie Ala Arg Arg Cys Val Ser 
405 410 

40 Gly Gly He Val Glu Glu Tyr Gin Leu Pro Tyr His Asp Leu Val Pro 
420 425 
Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu He Val Cys Met Lys Lys 



435 440 

Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Asp Glu' Cys Leu Arg 
450 455 460 

Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala Gin Asn Pro Ala Ser 
50 465 470 475 

Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met Ser Glu 
485 490 

55 Ser Gin Asp He Lys Leu 
500 



(2) INFORMATION FOR SEQ ID NO: 9: 



60 



15 



"« 20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
10 (A) NAME /KEY: CDS 

(B) LOCATION: 49.. 1341 « OSTEOGENIC PROTEIN" 

(D) OTHER INFORMATION: /function^ OSTEOGENIC wuiw 

/product = "OPl" 
/evidence= EXPERIMENTAL 
/standard_name= "OPl" 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGTGCGGGCC CGGAGCCCGG AGCCCGGGTA GCGCGTAGAG CCGGCGCG ATG CAC GTG 



1 



- 25 gg2SS2SSS2S2S22SS 

■+= 5 10 



CCC CTG TTC CTG CTG CGC TCC GCC CTG GCC GAC TTC AGC CTG GAC AAC 
Se £E 22 Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn 



€0 



135 140 
CAC CCA CGC TAC CAC CAT CGA GAG TTC CGG TTT GAT CTT TCC AAG ATC 



57 



rd ™r r.rrr. OCG CTC TGG GCA 105 



153 



201 



ji 30 20 25 

5 GAG GTG CAC TCG AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG 

« Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser Gin Olu Arg 

K 35 CGG GAG ATG CAG CGC GAG ATC CTC TCC ATT TTG GGC TTG CCC CAC CGC 249 
Arg Glu Met Gin Arg Glu He Leu Ser lie Leu Gly Leu Pro His Arg 
55 60 . " 



297 



CCG CGC CCG CAC CTC CAG GGC AAG CAC AAC TCG GCA CCC ATG TTC ATG 
Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro Met Pne Mec 
70 75 HU 

CTG GAC CTG TAC AAC GCC ATG GCG GTG GAG GAG GGC GGC GGG CCC GGC 
45 Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly Gly Pro Gly 
85 90 " 

GGC CAG GGC TTC TCC TAC CCC TAC AAG GCC GTC- TTC AGT ACC CAG GGC 
Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Tftr ej.n ^xy 
50 100 105 11° 

CCC CCT CTG GCC AGC CTG CAA GAT AGC CAT TTC CTC ACC GAC GCC GAC 
Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp 
120 125 "° 

" ATG GTC ATG AGC TTC GTC AAC CTC GTG GAA CAT GAC AAG GAA TTC TTC 489 
Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu pne rne 



345 



393 



441 



537 



m 



o 



20 



40 



45 



His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys He 
150 155 



CAC AGC CGT ACC CTC TCG GCC TCC GAG GAG GGC TCC . CTG GTG TTT GAC 
Asp. Ser Arg Thr Leu Trp Ala Ser Glu Giu ^J-y " 
215 220 

ATC ACA GCC ACC AGC AAC CAC TGG GTG GTC AAT CCG CGG CAC AAC CTG 
He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn m 
230 235 

1 GGC CTG CAG CTC TCG GTG GAG ACG CTG GAT GGG CAG AGC ATC AAC CCC 

in 25 Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser lie Asn Pro 

-£ 245 250 

U AAG TTG GCG GGC CTG ATT GGG CGG CAC GGG CCC CAG AAC AAG CAG CCC 

£s lS Ala Gly Leu lie Gly Arg His Gly Pro Gin Asn Lys Gin Pro 
\± 30 260 265 270 

TTC ATG GTG GCT TTC TTC AAG GCC ACG GAG GTC CAC TTC CGC AGC ATC 
l£e Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe Arg Ser He 
280 285 



60 



AAC GCC ACC AAC CAC GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC 
Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Phe lie Asn 
375 380 385 

CCG GAA ACG GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC 



585 



CCA GAA GGG GAA GCT GTC ACG GCA GCC GAA TTC CGG ATC TAC AAG GAC 
5 Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys a p 
165 l™ 17b 

TAC ATC CGG GAA CGC TTC GAC AAT GAG ACG TTC CGG ATC AGC GTT TAT 
Tyr lie Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg He Ser Val Tyr 
10 180 185 19° 

CAG GTG CTC CAG GAG CAC TTG GGC AGG GAA TCG GAT CTC TTC CTG CTC 
Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu Phe Leu Leu 
200 205 
15 ™~ nnr TOG CTG GTG TTT GAC 729 



633 



681 



777 



825 



873 



921 



" CGG TCC ACG GGG AGC AAA CAG CGC AGC CAG AAC CGC TCC AAG ACG CCC 969 
Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser Lys Tnr 



295 300 

AAG AAC CAG GAA GCC CTG CGG ATG GCC AAC GTG GCA GAG AAC AGC AGC 
Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu Asn ser 
310 315 "° 

AGC GAC CAG AGG CAG GCC TGT AAG AAG CAC GAG CTG TAT GTC AGC TTC 
Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val ser pne 
325 330 335 

CGA GAC CTG GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC 
Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu Gly Tyr Aia 
50 340 345 350 

GCC TAC TAC TGT GAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG 
Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met 
360 365 
55 ^mo nun mv' ATC AAC 1209 



1017 



1065 



1113 



1161 



1257 



10 420 



tn 20 



30 



35 



45 



50 



Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala 

390 395 
ATC TCC GTC CTC TAG TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA 
lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu y 

405 410 
TAG AGA AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC 
g£ Jrg £n Mel Val Val Arg Ala Cys Oly Cys His 



GAGAATTCAG ACCCTTTGGG GCCAAGTTTT TCTGGATCCT CCATTGCTCG CCTTGGCCAG 
GAACCAGCAG ACCAACTGCC TTTTGTGAGA CCTTCCCCTC CCTATCCCCA AC TTTAAAGG 
TGTGAGAGTA TTAGGAAACA TGAGCAGCAT ATGGCTTTTG ATCAGTTTTT CAGTGGCAGC 
ATCCAATGAA CAAGATCCTA CAAGCTGTGC AGGCAAAACC TAGCAGGAAA AAAAAACAAC 
GCATAAAGAA AAATGGCCGG GCCAGGTCAT TGGCTGGGAA GTCTCAGCCA TGCACGGACT 
CGTTTCCAGA GGTAATTATG AGCGCCTACC AGCCAGGCCA CCCAGCCGTG GGAGGAAGGG 
GGCGTGGCAA GGGGTGGGCA CATTGGTGTC TGTGCGAAAG GAAAATTGAC CCGGAAGTTC 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

40 Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 

1-5 10 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 

Leu Asp Asn Glu Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser 
35 40 

Gin Glu Arg Arg Glu Met Gin Arg Glu lie Leu Ser lie Leu Gly Leu 

50; 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
65 70 75 

55 Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 

Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser 
100 105 



1305 



1351 



25 CTGTAATAAA TGTCACAATA AAACGAATGA ATGAAAAAAA AAAAAAAAAA A 1822 
(2) INFORMATION FOR SEQ ID NO: 10: 



• ... • 



Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 



115 



120 



Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 
130 135 



Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 
145 150 155 

X0 Ser Lys He Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He 

Tyr Lys Asp Tyr lie Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg lie 



180 



" Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 



Phe Leu Leu Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu 



20 



210 " 215 220 



* val Phe Asp lie Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg 

m 225 230 235 

m 25 His Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 

Zl 245 " u 

W lie Asn Pro Lys Leu Ala Gly Leu lie Gly Arg His Gly Pro Gin Asn 

3 260 265 

Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe 

275 280 
Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser 

290 295 300 

Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu 
305 310 315 

40 Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr 

325 330 
Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu 



30 



35 



340 345 



45 



Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn 
355 360 - 303 

Ser Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His 
50 370 375 380 

Phe He Asn Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin 
385 390 395 

55 Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He 

405 410 

Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 * JW 



- 82 - 



10 



15 



y3 20 



3 

U 30 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /label* op -X selected from 

/note= "Each Xaa is independently selecteairom 
a group of one or more specified amino acids 
as defined in the specification" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Cys Xaa Xaa His Glu Leu Tyr Val Xaa Phe Xaa Asp Leu Gly Trp Xaa 
yj 1 5 10 

| 25 Asp Trp Xaa lie Ala Pro Xaa Gly Tyr Xaa Ala Tyr Tyr Cys Glu Gly 

£ Glu Cys Xaa Phe Pro Leu Xaa Ser Xaa Met Asn Ala Thr Asn His Ala 

35 40 D 

lie Xaa Gin Xaa Leu Val His Xaa Xaa Xaa Pro Xaa Xaa Val Pro Lys 
S 50 55 60 

m Xaa Cys Cys Ala Pro Thr Xaa Leu Xaa Ala Xaa Ser Val Leu Tyr Xaa 

□ 35 65 70 75 

S Asp Xaa Ser Xaa Asn Val Xaa Leu Xaa Lys Xaa Arg Asn Met Val Val 

85 90 

40 Xaa Ala Cys Gly Cys His 

100 

(2) INFORMATION FOR SEQ ID NO: 12: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
55 (iv) ANTI-SENSE: NO 



(xi) 

60 



SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



• S3- # 



10 



2 8 

GCGGATCCTG TTGTGAAGGN AATATGTG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

20 24 
GCGATCCGTC GCAGTCAAAA TTTT 

(2) INFORMATION FOR SEQ ID NO: 14: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
35 (iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

40 26 
GCGGATCCGC GATATATTAA AAGCAA 

(2) INFORMATION FOR SEQ ID NO: 15: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
55 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

20 

60 CGGAATTCTG GTGCCATATA 



